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Gefitinib Sensitivity-Related Gene Expression 
and Products and Methods Related Thereto 



Background of the Invention 

Lung Cancer is the leading cause of death from cancer worldwide. Chemotherapy is the 
mainstay of treatment for lung cancer. However, less than a third of patients with advanced stages 
of non-small cell lung cancer (NSCLC) respond to the best two chemotherapy drug combinations. 
Therefore, novel agents that target cancer specific biological pathways are needed. 

The epidermal growth factor receptor (EGFR) is one of the most appealing targets for novel 
therapies for cancer. EGFR plays a major role in transmitting stimuli that lead to proliferation, 
growth and survival of various cancer types, including, but not limited to, NSCLC. Ligand binding 
to the EGFR receptor leads to homo- or heterodimerization of EGFR with other ErbB receptors. 
EGFR is overexpressed in a large proportion of invasive NSCLC and in premalignant bronchial 
lesions. Bronchioloalveolar carcinoma (BAC), a subtype of non-small cell lung cancer, represents 
the major form of lung cancer in non-smoking females and is rising in frequency, and epidermal 
growth factor receptor (EGFR) is expressed with high frequency in BAC. Unfortunately, the 
response of BACs to conventional chemotherapy is poor. Activation of the EGFR receptor leads to 
simultaneous activation of several signaling cascades including the MAPK pathway, the protein 
kinase C (PKC) pathway and the PI(3)K-activated AKT pathway. EGFR signaling translated in the 
nucleus leads to cancer cell proliferation and survival. 

Targeted therapy against the EGFR receptor has produced response rates of 25-30% as first 
line treatment and 11-20% in 2 nd and 3 rd line settings (e.g., chemo-refractory advanced stage 
NSCLC). For example, in phase II clinical trials, 11-20% of patients with chemo-refractory 
advanced stage NSCLC responded to treatment with the EGFR tyrosine kinase inhibitor gefitinib 
(commercially available as Iressa®, ZD 1 839). A trial evaluating the activity of the EGFR inhibitor, 
erlotinib (Tarceva®, OSI-774) has been completed and the results will be reported in the near future. 
A retrospective analysis of 140 patients responding to treatment with gefitinib revealed that the 
presence of BAC features (p=0.005) and being a never smoker (p=0.007) were the only independent 
predictors of response to gefitinib. These data suggest that EGFR inhibitor therapy is more active 
in BAC and in non-smokers. 



However, currently, there are no selection criteria for determining which NSCLC patients 
will benefit from treatment with EGFR inhibitors such as gefitinib. Moreover, EGFR expression 
does not predict gefitinib sensitivity. Therefore, despite the correlation of tumor histology and 
smoking history with gefitinib response, it is of great importance to identify molecular molecules 
that influence gefitinib responsiveness, and to develop adjuvant treatments that enhance the 
response. To accomplish this goal, there is a need in the art to define critical aspects of EGFR 
signaling and to identify which molecules interact with the EGFR pathway to dictate responsiveness 
to EGFR inhibitors. 

Description of t he Invention 
The present invention generally relates to the identification, provision and use of a panel of 
biomarkers that predict sensitivity or resistance to gefitinib, and products and processes related 
thereto. Specifically, the present inventors have used NSCLC cell lines with varying sensitivity to 
the EGFR inhibitor, gefitinib, to define the novel panel of biomarkers as described herein. In order 
to identify a marker panel that could be used for selection of NSCLC patients who will respond to 
gefitinib treatment, the inventors undertook preclinical in vitro studies using NSCLC cell lines. 
Based on the therapeutic response to gefitinib by using the IC 50 definition (i.e., the concentration 
of agent needed to kill 50% of the tumor cells in a cell culture), the present inventors have classified 
the cell lines as sensitive (IC5o<l p.M), resistant (>10 |iM), or having intermediate sensitivity (1 \iM 
<IC50< 10 |iM) to gefitinib. The cell lines were characterized by gene microarray analysis 
(Affymetrix™ microarray HGU 133A, 22.000 genes). By comparing the gene microarray results 
from sensitive and resistant cell lines, the inventors have identified a panel of genes that can 
discriminate between sensitive and resistant cell lines. These biomarkers (i.e., the genes identified) 
will be of great clinical significance in selecting NSCLC patients/human tumors which will respond 
to this agent. The biomarkers identified by the present invention, and their expression levels in 
gefitinib sensitive and resistant cells, are listed in Table 1. It is to be understood that the present 
invention expressly covers additional genes that can be elucidated using substantially the same 
techniques used to identify the genes in Table 1 and that any of such additional genes can be used 
in the methods and products described herein for the genes in Table 1 . Any reference to database 
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Accession numbers or other information regarding the genes in Table 1 is hereby incorporated by 
reference in its entirety. 

In addition, the present invention wall also be useful for the validation in other studies of the 
clinical significance of many of the specific biomarkers described herein, as well as the identification 
of preferred biomarker profiles, highly sensitive biomarkers, and targets for the design of novel 
therapeutic products and strategies. The biomarkers described herein are particularly useful in 
clinical practice to select the patients who will benefit most from EGFR inhibitor treatment and 
specifically, from gefitinib treatment. 

The present inventors have already used the biomarkers described herein to identify specific 
targets for the further development of diagnostic and therapeutic approaches used in cancer. For 
example, E-cadherin is a calcium-dependent epithelial cell adhesion molecule that plays an important 
role in tumor invasiveness and metastatic potential. Reduced E-cadherin expression is associated 
with tumor cell dedifferentiation, advanced stage and reduced survival in patients with NSCLC. 
Using Western blot analysis, E-cadherin was expressed in three cell lines highly sensitive to gefitinib 
and its expression was lacking in six gefitinib resistant cell lines tested. High-density 
oligonucleotide microarrays were used to evaluate the gene expression pattern in 1 1 NSCLC cell 
lines and compared to gene expression in normal bronchial epithelium. E-cadherin expression was 
elevated in cell lines sensitive to gefitinib and downregulated in the resistant cell lines as compared 
to the normal bronchial epithelium. These results were confirmed with real-time RT-PCR. The 
expression of E-cadherin is regulated by zinc finger inhibitory proteins by the recruitment of histone 
deacetylases (HDAC). Using real-time RT-PCR, the expression of the two zinc-finger transcription 
factors, 5EF1/ZEB1 and SIP1/ZEB2, involved in E-cadherin repression was evaluated. Results 
showed that both ZEB 1 and SIP 1 were expressed in gefitinib resistant cell lines and their expression 
was lacking in gefitinib sensitive cell lines. This expression pattern of ZEB1 and SIP1 was 
confirmed in the microarray analysis. The present inventors have also found that 8EF1/ZEB1 and 
SIP1/ZEB2 may regulate Her3, which is an EGFR heterodimer. These data indicate that the 
expression of the zinc finger containing proteins SIP1 and ZEB1 may predict resistance to EGFR 
tyrosine kinase inhibitors and future studies directed at modulating the regulation of E-cadherin 
expression are expected to enhance the activity of EGFR inhibitors in NSCLC. 
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Finally, the present invention also relates to protein profiles which can discriminate between 
sensitive and resistant NSCLC tumors. 

Prior to the present invention, to the best of the present inventors' knowledge, no single 
marker, or marker panel, has been demonstrated to be useful for selection of lung cancer patients 
who will benefit from EGFR inhibitor, and particularly, gefitinib, treatment. Nor are there any such 
markers (related to EGFR inhibitors) identified for other types of cancer. 

Accordingly, using the gene expression profiles disclosed in Table 1 for gefitinib sensitive 
and resistant cells, one can rapidly, effectively and efficiently screen patients/human tumors for a 
level of sensitivity or resistance to gefitinib and also to other EGFR inhibitors having biological 
activity substantially similar to gefitinib (i.e., drugs having similar activities, gefitinib agonists and 
other derivatives). The results will allow for the identification of tumors/patients that are likely to 
benefit from administration of the drug and therefore, the genes are used to enhance the ability of 
the clinician to develop prognosis and treatment protocols for the individual patient. In addition, 
genes identified in Table 1 can be further validated as targets and then used in assays to identify 
therapeutic reagents useful for regulating the expression or activity of the target in a manner that 
improves sensitivity of a cell to gefitinib or analogs thereof. The knowledge provided from the 
expression profile of genes described herein and the identification additional genes using similar 
methods can also be used to identify the molecular mechanisms of EGFR inhibition, such knowledge 
being useful for the further development of new therapies and even analogs of gefitinib or other 
EGFR inhibitors with improved efficacies in cancer treatment. Moreover, given the knowledge of 
these genes, one can produce novel combinations of polynucleotides and/or antibodies and/or 
peptides for use in the various assays, diagnostic and/or therapeutic approaches described herein. 

Various definitions and aspects of the invention will be described below, but the invention 
is not limited to any specific embodiments that may be used for illustrative or exemplary purposes. 

According to the present invention, in general, the biological activity or biological action of 
a protein refers to any function(s) exhibited or performed by the protein that is ascribed to the 
naturally occurring form of the protein as measured or observed in vivo (i.e., in the natural 
physiological environment of the protein) or in vitro (i.e., under laboratory conditions). 
Modifications of a protein, such as in a homologue or mimetic (discussed below), may result in 
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proteins having the same biological activity as the naturally occurring protein, or in proteins having 
decreased or increased biological activity as compared to the naturally occurring protein. 
Modifications which result in a decrease in protein expression or a decrease in the activity of the 
protein, can be referred to as inactivation (complete or partial), down-regulation, or decreased action 
of a protein. Similarly, modifications which result in an increase in protein expression or an increase 
in the activity of the protein, can be referred to as amplification, overproduction, activation, 
enhancement, up-regulation or increased action of a protein. 

According to the present invention, a "downstream gene" or "endpoint gene" is any gene, the 
expression of which is regulated (up or down) within a gefitinib sensitive or resistant cell. Selected 
sets of one, two, and preferably several or many of the genes (up to the number equivalent to all of 
the genes) of this invention can be used as end-points for rapid screening of patient cells for 
sensitivity or resistance to EGFR inhibitors such as gefitinib and for the other methods as described 
herein, including the identification of novel targets for the development of new cancer therapeutics. 

As used herein, the term "homologue" is used to refer to a protein or peptide which differs 
from a naturally occurring protein or peptide (i.e., the "prototype" or "wild-type" protein) by minor 
modifications to the naturally occurring protein or peptide, but which maintains the basic protein and 
side chain structure of the naturally occurring form. Such changes include, but are not limited to: 
changes in one or a few amino acid side chains; changes one or a few amino acids, including 
deletions (e.g., a truncated version of the protein or peptide) insertions and/or substitutions; changes 
in stereochemistry of one or a few atoms; and/or minor derivatizations, including but not limited to: 
methylation, glycosylation, phosphorylation, acetylation, myristoylation, prenylation, palmitation, 
amidation and/or addition of glycosylphosphatidyl inositol. A homologue can have either enhanced, 
decreased, or substantially similar properties as compared to the naturally occurring protein or 
peptide. A homologue can include an agonist of a protein or an antagonist of a protein. 

Homologues can be the result of natural allelic variation or natural mutation. A naturally 
occurring allelic variant of a nucleic acid encoding a protein is a gene that occurs at essentially the 
same locus (or loci) in the genome as the gene which encodes such protein, but which, due to natural 
variations caused by, for example, mutation or recombination, has a similar but not identical 
sequence. Allelic variants typically encode proteins having similar activity to that of the protein 
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encoded by the gene to which they are being compared. One class of allelic variants can encode the 
same protein but have different nucleic acid sequences due to the degeneracy of the genetic code. 
Allelic variants can also comprise alterations in the 5' or 3' untranslated regions of the gene (e.g., in 
regulatory control regions). Allelic variants are well known to those skilled in the art. 

An agonist, as used herein, is a compound that is characterized by the ability to agonize (e.g. , 
stimulate, induce, increase, enhance, or mimic) the biological activity of a naturally occurring or 
reference protein or compound. More particularly, an agonist can include, but is not limited to, a 
compound, protein, peptide, or nucleic acid that mimics or enhances the activity of the natural or 
reference compound, and includes any homologue, mimetic, or any suitable product of 
drug/compound/peptide design or selection which is characterized by its ability to agonize (e.g., 
stimulate, induce, increase, enhance) the biological activity of a naturally occurring or reference 
compound. 

An antagonist refers to any compound which inhibits (e.g., antagonizes, reduces, decreases, 
blocks, reverses, or alters) the effect of a naturally occurring or reference compound as described 
above. More particularly, an antagonist is capable of acting in a manner relative to the activity of 
the reference compound, such that the biological activity of the natural or reference compound, is 
decreased in a manner that is antagonistic (e.g., against, a reversal of, contrary to) to the natural 
action of the reference compound. Such antagonists can include, but are not limited to, any 
compound, protein, peptide, or nucleic acid (including ribozymes and antisense) or product of 
drug/compound/peptide design or selection that provides the antagonistic effect. 

Agonists and antagonists that are products of drug design can be produced using various 
methods known in the art. Various methods of drug design, useful to design mimetics or other 
compounds useful in the present invention are disclosed in Maulik et al., 1997, Molecular 
Biotechnology: Therapeutic Applications and Strategies, Wiley-Liss, Inc., which is incorporated 
herein by reference in its entirety. An agonist or antagonist can be obtained, for example, from 
molecular diversity strategies (a combination of related strategies allowing the rapid construction of 
large, chemically diverse molecule libraries), libraries of natural or synthetic compounds, in 
particular from chemical or combinatorial libraries (i.e., libraries of compounds that differ in 
sequence or size but that have the similar building blocks) or by rational, directed or random drug 
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design. See for example, Maulik et al., supra. 

In a molecular diversity strategy, large compound libraries are synthesized, for example, from 
peptides, oligonucleotides, natural or synthetic steroidal compounds, carbohydrates and/or natural 
or synthetic organic and non-steroidal molecules, using biological, enzymatic and/or chemical 
approaches. The critical parameters in developing a molecular diversity strategy include subunit 
diversity, molecular size, and library diversity. The general goal of screening such libraries is to 
utilize sequential application of combinatorial selection to obtain high-affinity ligands for a desired 
target, and then to optimize the lead molecules by either random or directed design strategies. 
Methods of molecular diversity are described in detail in Maulik, et al., ibid. 

As used herein, the term "mimetic" is used to refer to any natural or synthetic compound, 
peptide, oligonucleotide, carbohydrate and/or natural or synthetic organic molecule that is able to 
mimic the biological action of a naturally occurring or known synthetic compound. 

As used herein, the term "putative regulatory compound" or "putative regulatory ligand" 
refers to compounds having an unknown regulatory activity, at least with respect to the ability of 
such compounds to regulate the expression or biological activity of a gene or protein encoded 
thereby, or to regulate sensitivity or resistance to an EGFR inhibitor as encompassed by the present 
invention. 

In accordance with the present invention, an isolated polynucleotide, or an isolated nucleic 
acid molecule, is a nucleic acid molecule that has been removed from its natural milieu (i.e., that has 
been subject to human manipulation), its natural milieu being the genome or chromosome in which 
the nucleic acid molecule is found in nature. As such, "isolated" does not necessarily reflect the 
extent to which the nucleic acid molecule has been purified, but indicates that the molecule does not 
include an entire genome or an entire chromosome in which the nucleic acid molecule is found in 
nature. Polynucleotides useful in the plurality of polynucleotides of the present invention (described 
below) are typically a portion of a gene of the present invention that is suitable for use as a 
hybridization probe or PCR primer for the identification of a full-length gene (or portion thereof) in 
a given sample (e.g., a cell sample). An isolated nucleic acid molecule can include a gene or a 
portion of a gene (e.g., the regulatory region or promoter), for example, to produce a reporter 
construct according to the present invention. An isolated nucleic acid molecule that includes a gene 
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is not a fragment of a chromosome that includes such gene, but rather includes the coding region and 
regulatory regions associated with the gene, but no additional genes naturally found on the same 
chromosome. An isolated nucleic acid molecule can also include a specified nucleic acid sequence 
flanked by (i.e., at the 5' and/or the 3' end of the sequence) additional nucleic acids that do not 
normally flank the specified nucleic acid sequence in nature (i.e., heterologous sequences). Isolated 
nucleic acid molecule can include DNA, RNA (e.g., mRNA), or derivatives of either DNA or RNA 
(e.g., cDNA). Although the phrase "nucleic acid molecule" primarily refers to the physical nucleic 
acid molecule and the phrase "nucleic acid sequence" primarily refers to the sequence of nucleotides 
on the nucleic acid molecule, the two phrases can be used interchangeably, especially with respect 
to a nucleic acid molecule, or a nucleic acid sequence, being capable of encoding a protein. 
Preferably, an isolated nucleic acid molecule of the present invention is produced using recombinant 
DNA technology (e.g., polymerase chain reaction (PCR) amplification, cloning) or chemical 
synthesis. If the polynucleotide is an oligonucleotide probe, the probe preferably ranges from about 
5 to about 50 or about 500 nucleotides, more preferably from about 10 to about 40 nucleotides, and 
most preferably from about 15 to about 40 nucleotides in length. 

An isolated protein, according to the present invention, is a protein (including a peptide) that 
has been removed from its natural milieu (i.e., that has been subject to human manipulation) and can 
include purified proteins, partially purified proteins, recombinantly produced proteins, and 
synthetically produced proteins, for example. As such, "isolated" does not reflect the extent to which 
the protein has been purified. An isolated protein useful as an antagonist or agonist according to the 
present invention can be isolated from its natural source, produced recombinantly or produced 
synthetically. Smaller peptides useful as regulatory peptides are typically produced synthetically by 
methods well known to those of skill in the art. 

According to the present invention, the phrase "selectively binds to" refers to the ability of 
an antibody, antigen binding fragment or binding partner (antigen binding peptide) to preferentially 
bind to specified proteins. More specifically, the phrase "selectively binds" refers to the specific 
binding of one protein to another (e.g., an antibody, fragment thereof, or binding partner to an 
antigen), wherein the level of binding, as measured by any standard assay (e.g., an immunoassay), 
is statistically significantly higher than the background control for the assay. For example, when 
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performing an immunoassay, controls typically include a reaction well/tube that contain antibody 
or antigen binding fragment alone (i.e. , in the absence of antigen), wherein an amount of reactivity 
(e.g., non-specific binding to the well) by the antibody or antigen binding fragment thereof in the 
absence of the antigen is considered to be background. Binding can be measured using a variety of 
methods standard in the art including enzyme immunoassays (e.g., ELISA), immunoblot assays, 
etc.). 

In some embodiments of the present invention, a compound is contacted with one or more 
nucleic acids or proteins. Such methods can include cell-based assays, or non-cell-based assay. In 
one embodiment, a target gene is expressed by a cell (i.e., a cell-based assay). In one embodiment, 
the conditions under which a cell expressing a target is contacted with a putative regulatory 
compound, such as by mixing, are conditions in which the expression or biological activity of the 
target (gene or protein encoded thereby) is not stimulated (activated) if essentially no regulatory 
compound is present. For example, such conditions include normal culture conditions in the absence 
of a known activating compound or other equivalent stimulus. The putative regulatory compound 
is then contacted with the cell. In this embodiment, the step of detecting is designed to indicate 
whether the putative regulatory compound alters the expression and/or biological activity of the gene 
or protein target as compared to in the absence of the putative regulatory compound (i.e., the 
background level). 

In accordance with the present invention, a cell-based assay as described herein is conducted 
under conditions which are effective to screen for regulatory compounds or to profile gene 
expression as described in the methods of the present invention. Effective conditions include, but 
are not limited to, appropriate media, temperature, pH and oxygen conditions that permit the growth 
of the cell that expresses the receptor. An appropriate, or effective, medium is typically a solid or 
liquid medium comprising growth factors and assimilable carbon, nitrogen and phosphate sources, 
as well as appropriate salts, minerals, metals and other nutrients, such as vitamins. Culturing is 
carried out at a temperature, pH and oxygen content appropriate for the cell. Such culturing 
conditions are within the expertise of one of ordinary skill in the art. 

Cells that are useful in the cell-based assays of the present invention include any cell that 
expresses a gene that is to be investigated as a target, or in the diagnostic assays described herein, 
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any cell that is isolated from a patient, including normal or malignant (tumor) cells. 

According to the present invention, the method includes the step of detecting the expression 
of at least one, and preferably more than one, and most preferably, several, of the genes that have 
now been shown to be regulated differently in gefitinib-sensitive versus gefitinib-resistant cells by 
the present inventors. As used herein, the term "expression" , when used in connection with detecting 
the expression of a gene, can refer to detecting transcription of the gene and/or to detecting 
translation of the gene. To detect expression of a gene refers to the act of actively determining 
whether a gene is expressed or not. This can include determining whether the gene expression is 
upregulated as compared to a control, downregulated as compared to a control, or unchanged as 
compared to a control. Therefore, the step of detecting expression does not require that expression 
of the gene actually is upregulated or downregulated, but rather, can also include detecting that the 
expression of the gene has not changed (i.e., detecting no expression of the gene or no change in 
expression of the gene). 

The present method includes the step of detecting the expression of at least one gene set forth 
in Table 1 . In a preferred embodiment, the step of detecting includes detecting the expression of at 
least 2 genes, and preferably at least 3 genes, and more preferably at least 4 genes, and more 
preferably at least 5 genes, and more preferably at least 6 genes, and more preferably at least 7 genes, 
and more preferably at least 8 genes, and more preferably at least 9 genes, and more preferably at 
least 10 genes, and more preferably at least 11 genes, and more preferably at least 12 genes, and 
more preferably at least 1 3 genes, and more preferably at least 1 4 genes, and more preferably at least 
15 genes, and so on, in increments of one, up to detecting expression of all of the genes disclosed 
herein in Table 1 . Preferably, larger numbers of genes in Table 1 are detected, as this will increase 
the sensitivity of the detection method. Analysis of a number of genes greater than 1 can be 
accomplished simultaneously, sequentially, or cumulatively. 

In one aspect, it may be desirable to select those genes for detection that are particularly 
highly regulated in gefitinib-sensitive cells versus gefitinib-resistant cells in that they display the 
largest increases or decreases in expression levels. The detection of such genes can be advantageous 
because the endpoint may be more clear and require less quantitation. The relative expression levels 
of the genes identified in the present invention are listed in Table 1 , and the genes are ranked in the 
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Table. Therefore, one can easily select subsets of particularly highly regulated genes, or subsets of 
genes based on some other desired characteristic to provide a more robust, sensitive, or selective 
assay. In one embodiment, one of skill in the art might choose to detect genes that exhibited a fold 
increase above background of at least 2. In another embodiment, one of skill in the art might choose 
to detect genes that exhibited a fold increase or decrease above background of at least 3, and in 
another embodiment at least 4, and in another embodiment at least 5, and in another embodiment 
at least 6, and in another embodiment at least 7, and in another embodiment at least 8, and in another 
embodiment at least 9, and in another embodiment at least 1 0 or higher fold changes. It is noted that 
fold increases or decreases are not typically compared from one gene to another, but with reference 
to the background level for that particular gene. 

In one aspect of the method of the present invention, the step of detecting can include the 
detection of expression of one or more of the genes of this invention. Expression of transcripts 
and/or proteins is measured by any of a variety of known methods in the art. For RNA expression, 
methods include but are not limited to: extraction of cellular mRNA and northern blotting using 
labeled probes that hybridize to transcripts encoding all or part of one or more of the genes of this 
invention; amplification of mRNA expressed from one or more of the genes of this invention using 
gene-specific primers and reverse transcriptase - polymerase chain reaction, followed by quantitative 
detection of the product by any of a variety of means; extraction of total RNA from the cells, which 
is then labeled and used to probe cDNAs or oligonucleotides encoding all or part of the genes of this 
invention, arrayed on any of a variety of surfaces. 

Methods to measure protein expression levels of selected genes of this invention, include, 
but are not limited to: western blotting, immunocytochemistry, flow cytometry or other 
immunologic-based assays; assays based on a property of the protein including but not limited to 
DNA binding, ligand binding, or interaction with other protein partners. 

Nucleic acid arrays are particularly useful for detecting the expression of the genes of the 
present invention. The production and application of high-density arrays in gene expression 
monitoring have been disclosed previously in, for example, WO 97/10365; WO 92/10588; U.S. 
Patent No. 6,040,138; U.S. 5,445,934; or WO95/35505, all of which are incorporated herein by 
reference in their entireties. Also for examples of arrays, see Hacia et al. (1996) Nature Genetics 
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14:441-447; Lockhart et al. (1996) Nature Biotechnol. 14:1675-1680; and De Risi et al. (1996) 
Nature Genetics 14:457-460. In general, in an array, an oligonucleotide, a cDNA, or genomic DNA, 
that is a portion of a known gene occupies a known location on a substrate. A nucleic acid target 
sample is hybridized with an array of such oligonucleotides and then the amount of target nucleic 
acids hybridized to each probe in the array is quantified. One preferred quantifying method is to use 
confocal microscope and fluorescent labels. The Affymetrix GeneChip™ Array system (Affymetrix, 
Santa Clara, Calif.) and the Atlas™ Human cDNA Expression Array system are particularly suitable 
for quantifying the hybridization; however, it will be apparent to those of skill in the art that any 
similar systems or other effectively equivalent detection methods can also be used. In a particularly 
preferred embodiment, one can use the knowledge of the genes described herein to design novel 
arrays of polynucleotides, cDNAs or genomic DNAs for screening methods described herein. Such 
novel pluralities of polynucleotides are contemplated to be a part of the present invention and are 
described in detail below. 

Suitable nucleic acid samples for screening on an array contain transcripts of interest or 
nucleic acids derived from the transcripts of interest. As used herein, a nucleic acid derived from 
a transcript refers to a nucleic acid for whose synthesis the mRNA transcript or a subsequence 
thereof has ultimately served as a template. Thus, a cDNA reverse transcribed from a transcript, an 
RNA transcribed from that cDNA, a DNA amplified from the cDNA, an RNA transcribed from the 
amplified DNA, etc., are all derived from the transcript and detection of such derived products is 
indicative of the presence and/or abundance of the original transcript in a sample. Thus, suitable 
samples include, but are not limited to, transcripts of the gene or genes, cDNA reverse transcribed 
from the transcript, cRNA transcribed from the cDNA, DNA amplified from the genes, RNA 
transcribed from amplified DNA, and the like. Preferably, the nucleic acids for screening are 
obtained from a homogenate of cells or tissues or other biological samples. Preferably, such sample 
is a total RNA preparation of a biological sample. More preferably in some embodiments, such a 
nucleic acid sample is the total mRNA isolated from a biological sample. Biological samples may 
be of any biological tissue or fluid or cells from any organism. Frequently the sample will be a 
"clinical sample" which is a sample derived from a patient, such as a lung tumor sample from a 
patient. Typical clinical samples include, but are not limited to, sputum, blood, blood cells (e.g., 
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white cells), tissue or fine needle biopsy samples, urine, peritoneal fluid, and pleural fluid, or cells 
therefrom. Biological samples may also include sections of tissues, such as frozen sections or 
formalin fixed sections taken for histological purposes. 

In one embodiment, it is desirable to amplify the nucleic acid sample prior to hybridization. 
One of skill in the art will appreciate that whatever amplification method is used, if a quantitative 
result is desired, care must be taken to use a method that maintains or controls for the relative 
frequencies of the amplified nucleic acids to achieve quantitative amplification. Methods of 
"quantitative" amplification are well known to those of skill in the art. For example, quantitative 
PCR involves simultaneously co-amplifying a known quantity of a control sequence using the same 
primers. This provides an internal standard that may be used to calibrate the PCR reaction. The high- 
density array may then include probes specific to the internal standard for quantification of the 
amplified nucleic acid. Other suitable amplification methods include, but are not limited to 
polymerase chain reaction (PCR) Innis, et al., PCR Protocols. A guide to Methods and Application. 
Academic Press, Inc. San Diego, (1990)), ligase chain reaction (LCR) (see.Wu and Wallace, 
Genomics, 4: 560 (1989), Landegren, et al., Science, 241 : 1077 (1988) and Barringer, et al., Gene, 
89: 117 (1990), transcription amplification (Kwoh, et al., Proc. Natl. Acad. Sci. USA, 86: 1173 

(1 989) ), and self-sustained sequence replication (Guatelli, et al, Proc. Nat. Acad. Sci. USA, 87: 1 874 

(1990) ). 

Nucleic acid hybridization simply involves contacting a probe and target nucleic acid under 
conditions where the probe and its complementary target can form stable hybrid duplexes through 
complementary base pairing. As used herein, hybridization conditions refer to standard hybridization 
conditions under which nucleic acid molecules are used to identify similar nucleic acid molecules. 
Such standard conditions are disclosed, for example, in Sambrook et al., Molecular Cloning: A 
Laboratory Manual, Cold Spring Harbor Labs Press, 1989. Sambrook et al., ibid., is incorporated 
by reference herein in its entirety (see specifically, pages 9.31-9.62). In addition, formulae to 
calculate the appropriate hybridization and wash conditions to achieve hybridization permitting 
varying degrees of mismatch of nucleotides are disclosed, for example, in Meinkoth et al., 1984, 
Anal. Biochem. 138, 267-284; Meinkoth et al., ibid., is incorporated by reference herein in its 
entirety. Nucleic acids that do not form hybrid duplexes are washed away from the hybridized 
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nucleic acids and the hybridized nucleic acids can then be detected, typically through detection of 
an attached detectable label. It is generally recognized that nucleic acids are denatured by increasing 
the temperature or decreasing the salt concentration of the buffer containing the nucleic acids. Under 
low stringency conditions (e.g., low temperature and/or high salt) hybrid duplexes (e.g., DNA:DNA, 
RNA:RNA, or RNA:DNA) will form even where the annealed sequences are not perfectly 
complementary. Thus specificity of hybridization is reduced at lower stringency. Conversely, at 
higher stringency (e.g., higher temperature or lower salt) successful hybridization requires fewer 
mismatches. 

High stringency hybridization and washing conditions, as referred to herein, refer to 
conditions which permit isolation of nucleic acid molecules having at least about 90% nucleic acid 
sequence identity with the nucleic acid molecule being used to probe in the hybridization reaction 
(i.e., conditions permitting about 10% or less mismatch of nucleotides). One of skill in the art can 
use the formulae in Meinkoth et al., 1984, Anal. Biochem. 138, 267-284 (incorporated herein by 
reference in its entirety) to calculate the appropriate hybridization and wash conditions to achieve 
these particular levels of nucleotide mismatch. Such conditions will vary, depending on whether 
DNA:RNA or DNA:DNA hybrids are being formed. Calculated melting temperatures for 
DNA:DNA hybrids are 1 0°C less than for DNA:RNA hybrids. In particular embodiments, stringent 
hybridization conditions for DNA:DNA hybrids include hybridization at an ionic strength of 6X SSC 
(0.9 M Na + ) at a temperature of between about 20°C and about 35°C, more preferably, between 
about 28°C and about 40°C, and even more preferably, between about 35°C and about 45°C. In 
particular embodiments, stringent hybridization conditions for DNA:RNA hybrids include 
hybridization at an ionic strength of 6X SSC (0.9 M Na + ) at a temperature of between about 30°C 
and about 45°C, more preferably, between about 38°C and about 50°C, and even more preferably, 
between about 45°C and about 55°C. These values are based on calculations of a melting 
temperature for molecules larger than about 100 nucleotides, 0% formamide and a G + C content of 
about 40%. Alternatively, T m can be calculated empirically as set forth in Sambrook et al., supra, 
pages 9.31 to 9.62. 

The hybridized nucleic acids are detected by detecting one or more labels attached to the 
sample nucleic acids. The labels may be incorporated by any of a number of means well known to 
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those of skill in the art. Detectable labels suitable for use in the present invention include any 
composition detectable by spectroscopic, photochemical, biochemical, immunochemical, electrical, 
optical or chemical means. Useful labels in the present invention include biotin for staining with 
labeled streptavidin conjugate, magnetic beads (e.g., Dynabeads.TM.), fluorescent dyes (e.g., 
fluorescein, texas red, rhodamine, green fluorescent protein, and the like), radiolabels (e.g., 3 H, 125 I, 
35 S, 14 C, or 32 P), en2ymes (e.g., horse radish peroxidase, alkaline phosphatase and others commonly 
used in an ELISA), and colorimetric labels such as colloidal gold or colored glass or plastic (e.g., 
polystyrene, polypropylene, latex, etc.) beads. Means of detecting such labels are well known to 
those of skill in the art. Thus, for example, radiolabels may be detected using photographic film or 
scintillation counters, fluorescent markers may be detected using a photodetector to detect emitted 
light. Enzymatic labels are typically detected by providing the enzyme with a substrate and detecting 
the reaction product produced by the action of the enzyme on the substrate, and colorimetric labels 
are detected by simply visualizing the colored label. 

The term "quantifying" or "quantitating" when used in the context of quantifying 
transcription levels of a gene can refer to absolute or to relative quantification. Absolute 
quantification may be accomplished by inclusion of known concentration(s) of one or more target 
nucleic acids and referencing the hybridization intensity of unknowns with the known target nucleic 
acids (e.g. through generation of a standard curve). Alternatively, relative quantification can be 
accomplished by comparison of hybridization signals between two or more genes, or between two 
or more treatments to quantify the changes in hybridization intensity and, by implication, 
transcription level. 

In one aspect of the present method, in vitro cell based assays may be designed to screen for 
compounds that affect the regulation of genes at either the transcriptional or translational level. One, 
two or more promoters of the genes of this invention can be used to screen unknown compounds for 
activity on a given target. Promoters of the selected genes can be linked to any of several reporters 
(including but not limited to chloramphenicol acetyl transferase, or luciferase) that measure 
transcriptional read-out. The promoters can be tested as pure DNA, or as DNA bound to chromatin 
proteins. 

In one aspect of the present method, the step of detecting can include detecting the expression 
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of one or more genes of the invention in intact animals or tissues obtained from such animals. 
Mammalian (i.e. mouse, rat, monkey) or non-mammalian (ie. chicken) species can be the test 
animals. Sample tissues from a patient can also be screened. The tissues to be surveyed can be 
either normal or malignant tissues. The presence and quantity of endogenous mRNA or protein 
expression of one or more of the genes of this invention can be measured in those tissues. The gene 
markers can be measured in tissues that are fresh, frozen, fixed or otherwise preserved. They can 
be measured in cytoplasmic or nuclear organ-, tissue- or cell-extracts; or in cell membranes including 
but not limited to plasma, cytoplasmic, mitochondrial, golgi or nuclear membranes; in the nuclear 
matrix; or in cellular organelles and their extracts including but not limited to ribosomes, nuclei, 
nucleoli, mitochondria, or golgi. Assays for endogenous expression of mRNAs or proteins encoded 
by the genes of this invention can be performed as described above. Alternatively, intact transgenic 
animals can be generated for screening for research or validation purposes. 

Preferably, a gene identified as being upregulated or downregulated in a test cell according 
to the invention (including a sample tumor cell to be screened) is regulated in the same direction and 
to at least about 10%, and more preferably at least 20%, and more preferably at least 25%, and more 
preferably at least 30%, and more preferably at least 35%, and more preferably at least 40%, and 
more preferably at least 45%, and more preferably at least 50%, and preferably at least 55%, and 
more preferably at least 60%, and more preferably at least 65%, and more preferably at least 70%, 
and more preferably at least 75%, and more preferably at least 80%, and more preferably at least 
85%, and more preferably at least 90%, and more preferably at least 95%, of the level of expression 
of the gene that is seen in established or confirmed gefitinib-sensitive or gefitinib-resistant cells. 
Statistical significance should be at least p<0.05. 

It will be appreciated by those of skill in the art that differences between the expression of 
genes in sensitive versus resistant cells may be small or large. Some small differences may be very 
reproducible and therefore nonetheless useful. For other purposes, large differences may be 
desirable for ease of detection of the activity. It will be therefore appreciated that the exact boundary 
between what is called a positive result and a negative result can shift, depending on the goal of the 
screening assay and the genes to be screened. For some assays it may be useful to set threshold 
levels of change. One of skill in the art can readily determine the criteria for screening of cells given 
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the information provided herein. 

The presence and quantity of each gene marker can be measured in primary tumors, 
metastatic tumors, locally recurring tumors, ductal carcinomas in situ, or other tumors. The markers 
can be measured in solid tumors that are fresh, frozen, fixed or otherwise preserved. They can be 
measured in cytoplasmic or nuclear tumor extracts; or in tumor membranes including but not limited 
to plasma, mitochondrial, golgi or nuclear membranes; in the nuclear matrix; or in tumor cell 
organelles and their extracts including but not limited to ribosomes, nuclei, mitochondria, golgi. 

A profile of individual gene markers, including a matrix of two or more markers, can be 
generated by one or more of the methods described above. According to the present invention, a 
profile of the genes in a tissue sample refers to a reporting of the expression level of a given gene 
from Table 1, and includes a classification of the gene with regard to how the gene is regulated in 
gefitinib-sensitive versus gefitinib-resistant cells. The data can be reported as raw data, and/or 
statistically analyzed by any of a variety of methods, and/or combined with any other prognostic 
marker(s). 

Another embodiment of the present invention relates to a plurality of polynucleotides for the 
detection of the expression of genes as described herein. The plurality of polynucleotides consists 
of polynucleotide probes that are complementary to RNA transcripts, or nucleotides derived 
therefrom, of genes listed in Table 1 or otherwise identified as being useful according to the present 
invention, and is therefore distinguished from previously known nucleic acid arrays and primer sets. 
The plurality of polynucleotides within the above-limitation includes at least one or more, but is not 
limited to one or more, polynucleotide probes that are complementary to RNA transcripts, or 
nucleotides derived therefrom, of genes identified by the present inventors and listed in Table 1 . 

In one embodiment, it is contemplated that additional genes that are not regulated differently 
in gefitinib-sensitive versus gefitinib-resistant cells can be added to the plurality of polynucleotides. 
Such genes would not be random genes, or large groups of unselected human genes, as are 
commercially available now, but rather, would be specifically selected to complement the sets of 
genes identified by the present invention. For example, one of skill in the art may wish to add to the 
above-described plurality of genes one or more genes that are of relevance because they are 
expressed by a particular tissue of interest (e.g., lung tissue), are associated with a particular disease 
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or condition of interest (e.g., NSCLC), or are associated with a particular cell, tissue or body function 
(e.g., angiogenesis). The development of additional pluralities of polynucleotides (and antibodies, 
as disclosed below), which include both the above-described plurality and such additional selected 
polynucleotides, are explicitly contemplated by the present invention. 

According to the present invention, a plurality of polynucleotides refers to at least 2, and 
more preferably at least 3, and more preferably at least 4, and more preferably at least 5, and more 
preferably at least 6, and more preferably at least 7, and more preferably at least 8, and more 
preferably at least 9, and more preferably at least 10, and so on, in increments of one, up to any 
suitable number of polynucleotides, including at least 1 00, 500, 1 000, 1 0 4 , 1 0 5 , or at least 1 0 6 or more 
polynucleotides. 

In one embodiment, the polynucleotide probes are conjugated to detectable markers. 
Detectable labels suitable for use in the present invention include any composition detectable by 
spectroscopic, photochemical, biochemical, immunochemical, electrical, optical or chemical means. 
Useful labels in the present invention include biotin for staining with labeled streptavidin conjugate, 
magnetic beads (e.g., Dynabeads.TM.), fluorescent dyes (e.g., fluorescein, texas red, rhodamine, 
green fluorescent protein, and the like), radiolabels (e.g., 3 H, 125 1, 35 S, 14 C, or 32 P), enzymes (e.g., 
horse radish peroxidase, alkaline phosphatase and others commonly used in an ELISA), and 
colorimetric labels such as colloidal gold or colored glass or plastic (e.g., polystyrene, polypropylene, 
latex, etc.) beads. Preferably, the polynucleotide probes are immobilized on a substrate. 

In one embodiment, the polynucleotide probes are hybridizable array elements in a 
microarray or high density array. Nucleic acid arrays are well known in the art and are described for 
use in comparing expression levels of particular genes of interest, for example, in U.S. Patent No. 
6, 1 77,248, which is incorporated herein by reference in its entirety. Nucleic acid arrays are suitable 
for quantifying a small variations in expression levels of a gene in the presence of a large population 
of heterogeneous nucleic acids. Knowing the identity of the genes of the present invention, nucleic 
acid arrays can be fabricated either by de novo synthesis on a substrate or by spotting or transporting 
nucleic acid sequences onto specific locations of substrate. Nucleic acids are purified and/or isolated 
from biological materials, such as a bacterial plasmid containing a cloned segment of sequence of 
interest. It is noted that all of the genes identified by the present invention have been previously 
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sequenced, at least in part, such that oligonucleotides suitable for the identification of such nucleic 
acids can be produced. The database accession number for each of the genes identified by the 
present inventors is provided in Table 1 . Suitable nucleic acids are also produced by amplification 
of template, such as by polymerase chain reaction or in vitro transcription. 

Synthesized oligonucleotide arrays are particularly preferred for this aspect of the invention. 
Oligonucleotide arrays have numerous advantages, as opposed to other methods, such as efficiency 
of production, reduced intra- and inter array variability, increased information content and high 
signal-to-noise ratio. 

One of skill in the art will appreciate that an enormous number of array designs are suitable 
for the practice of this invention. An array will typically include a number of probes that specifically 
hybridize to the sequences of interest. In addition, in a preferred embodiment, the array will include 
one or more control probes. The high-density array chip includes "test probes." Test probes could 
be oligonucleotides that range from about 5 to about 45 or 5 to about 500 nucleotides, more 
preferably from about 10 to about 40 nucleotides and most preferably from about 15 to about 40 
nucleotides in length. In other particularly preferred embodiments the probes are 20 or 25 nucleotides 
in length. In another preferred embodiments, test probes are double or single strand DN A sequences. 
DNA sequences are isolated or cloned from natural sources or amplified from natural sources using 
natural nucleic acids as templates, or produced synthetically. These probes have sequences 
complementary to particular subsequences of the genes whose expression they are designed to detect. 
Thus, the test probes are capable of specifically hybridizing to the target nucleic acid they are to 
detect. 

Another embodiment of the present invention relates to a plurality of antibodies, or antigen 
binding fragments thereof, for the detection of the expression of genes according to the present 
invention. The plurality of antibodies, or antigen binding fragments thereof, consists of antibodies, 
or antigen binding fragments thereof, that selectively bind to proteins encoded by genes described 
herein. According to the present invention, a plurality of antibodies, or antigen binding fragments 
thereof, refers to at least 2, and more preferably at least 3, and more preferably at least 4, and more 
preferably at least 5, and more preferably at least 6, and more preferably at least 7, and more 
preferably at least 8, and more preferably at least 9, and more preferably at least 10, and so on, in 
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increments of one, up to any suitable number of antibodies, or antigen binding fragments thereof, 
including at least 100, 500, or at least 1000 antibodies, or antigen binding fragments thereof. 

The invention also extends to non-antibody polypeptides, sometimes referred to as binding 
partners or antigen binding peptides, that have been designed to bind specifically to, and either 
activate or inhibit as appropriate, a target protein. Examples of the design of such polypeptides, 
which possess a prescribed ligand specificity are given in Beste et al. (Proc. Natl. Acad. Sci. 
96:1898-1903, 1999), incorporated herein by reference in its entirety. 

Limited digestion of an immunoglobulin with a protease may produce two fragments. An 
antigen binding fragment is referred to as an Fab, an Fab', or an F(ab') 2 fragment. A fragment lacking 
the ability to bind to antigen is referred to as an Fc fragment. An Fab fragment comprises one arm 
of an immunoglobulin molecule containing a L chain (V L + C L domains) paired with the V H region 
and a portion of the C H region (CHI domain). An Fab' fragment corresponds to an Fab fragment 
with part of the hinge region attached to the CHI domain. An F(ab') 2 fragment corresponds to two 
Fab' fragments that are normally covalently linked to each other through a di-sulfide bond, typically 
in the hinge regions. 

Isolated antibodies of the present invention can include serum containing such antibodies, 
or antibodies that have been purified to varying degrees. Whole antibodies of the present invention 
can be polyclonal or monoclonal. Alternatively, functional equivalents of whole antibodies, such 
as antigen binding fragments in which one or more antibody domains are truncated or absent (e.g., 
Fv, Fab, Fab', or F(ab) 2 fragments), as well as genetically-engineered antibodies or antigen binding 
fragments thereof, including single chain antibodies or antibodies that can bind to more than one 
epitope (e.g., bi-specific antibodies), or antibodies that can bind to one or more different antigens 
(e.g., bi- or multi-specific antibodies), may also be employed in the invention. 

Generally, in the production of an antibody, a suitable experimental animal, such as, for 
example, but not limited to, a rabbit, a sheep, a hamster, a guinea pig, a mouse, a rat, or a chicken, 
is exposed to an antigen against which an antibody is desired. Typically, an animal is immunized 
with an effective amount of antigen that is injected into the animal. An effective amount of antigen 
refers to an amount needed to induce antibody production by the animal. The animal's immune 
system is then allowed to respond over a pre-determined period of time. The immunization process 
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can be repeated until the immune system is found to be producing antibodies to the antigen. In order 
to obtain polyclonal antibodies specific for the antigen, serum is collected from the animal that 
contains the desired antibodies (or in the case of a chicken, antibody can be collected from the eggs). 
Such serum is useful as a reagent. Polyclonal antibodies can be further purified from the serum (or 
eggs) by, for example, treating the serum with ammonium sulfate. 

Monoclonal antibodies may be produced according to the methodology of Kohler and 
Milstein {Nature 256:495-497, 1975). For example, B lymphocytes are recovered from the spleen 
(or any suitable tissue) of an immunized animal and then fused with myeloma cells to obtain a 
population of hybridoma cells capable of continual growth in suitable culture medium. Hybridomas 
producing the desired antibody are selected by testing the ability of the antibody produced by the 
hybridoma to bind to the desired antigen. 

Finally, any of the genes of this invention, or their RNA or protein products, can serve as 
targets for therapeutic strategies. For example, neutralizing antibodies could be directed against one 
of the protein products of a selected gene, expressed on the surface of a tumor cell. Alternatively 
regulatory compounds that regulate (e.g., upregulate or downregulate) the expression and/or 
biological activity of a target gene (whether the product is intracellular, membrane or secreted), can 
be identified and/or designed using the genes described herein. 

If a suitable therapeutic compound is identified using the methods and genes of the present 
invention, a composition can be formulated. A composition, and particularly a therapeutic 
composition, of the present invention generally includes the therapeutic compound and a carrier, and 
preferably, a pharmaceutically acceptable carrier. According to the present invention, a 
"pharmaceutically acceptable carrier" includes pharmaceutically acceptable excipients and/or 
pharmaceutically acceptable delivery vehicles, which are suitable for use in administration of the 
composition to a suitable in vitro, ex vivo or in vivo site. A suitable in vitro, in vivo or ex vivo site 
is preferably a tumor cell. In some embodiments, a suitable site for delivery is a site of 
inflammation, near the site of a tumor, or a site of any other disease or condition in which regulation 
of the genes identified herein can be beneficial. Preferred pharmaceutically acceptable carriers are 
capable of maintaining a compound, a protein, a peptide, nucleic acid molecule or mimetic (drug) 
according to the present invention in a form that, upon arrival of the compound, protein, peptide, 
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nucleic acid molecule or mimetic at the cell target in a culture or in patient, the compound, protein, 
peptide, nucleic acid molecule or mimetic is capable of interacting with its target. 

Suitable excipients of the present invention include excipients or formularies that transport 
or help transport, but do not specifically target a composition to a cell (also referred to herein as non- 
targeting carriers). Examples of pharmaceutically acceptable excipients include, but are not limited 
to water, phosphate buffered saline, Ringer's solution, dextrose solution, serum-containing solutions, 
Hank's solution, other aqueous physiologically balanced solutions, oils, esters and glycols. Aqueous 
carriers can contain suitable auxiliary substances required to approximate the physiological 
conditions of the recipient, for example, by enhancing chemical stability and isotonicity. 

Suitable auxiliary substances include, for example, sodium acetate, sodium chloride, sodium 
lactate, potassium chloride, calcium chloride, and other substances used to produce phosphate buffer, 
Tris buffer, and bicarbonate buffer. Auxiliary substances can also include preservatives, such as 
thimerosal, m- or o-cresol, formalin and benzol alcohol. Compositions of the present invention can 
be sterilized by conventional methods and/or lyophilized. 

One type of pharmaceutically acceptable carrier includes a controlled release formulation that 
is capable of slowly releasing a composition of the present invention into a patient or culture. As 
used herein, a controlled release formulation comprises a compound of the present invention (e.g., 
a protein (including homologues), a drug, an antibody, a nucleic acid molecule, or a mimetic) in a 
controlled release vehicle. Suitable controlled release vehicles include, but are not limited to, 
biocompatible polymers, other polymeric matrices, capsules, microcapsules, microparticles, bolus 
preparations, osmotic pumps, diffusion devices, liposomes, lipospheres, and transdermal delivery 
systems. Other carriers of the present invention include liquids that, upon administration to a patient, 
form a solid or a gel in situ. Preferred carriers are also biodegradable (i.e., bioerodible). When the 
compound is a recombinant nucleic acid molecule, suitable delivery vehicles include, but are not 
limited to liposomes, viral vectors or other delivery vehicles, including ribozymes. Natural lipid- 
containing delivery vehicles include cells and cellular membranes. Artificial lipid-containing 
delivery vehicles include liposomes and micelles. A delivery vehicle of the present invention can 
be modified to target to a particular site in a patient, thereby targeting and making use of a compound 
of the present invention at that site. Suitable modifications include manipulating the chemical 
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formula of the lipid portion of the delivery vehicle and/or introducing into the vehicle a targeting 
agent capable of specifically targeting a delivery vehicle to a preferred site, for example, a preferred 
cell type. Other suitable delivery vehicles include gold particles, poly-L-lysine/DNA-molecular 
conjugates, and artificial chromosomes. 

A pharmaceutically acceptable carrier which is capable of targeting is herein referred to as 
a "delivery vehicle." Delivery vehicles of the present invention are capable of delivering a 
composition of the present invention to a target site in a patient. A "target site" refers to a site in a 
patient to which one desires to deliver a composition. For example, a target site can be any cell 
which is targeted by direct injection or delivery using liposomes, viral vectors or other delivery 
vehicles, including ribozymes and antibodies. Examples of delivery vehicles include, but are not 
limited to, artificial and natural lipid-containing delivery vehicles, viral vectors, and ribozymes. 
Natural lipid-containing delivery vehicles include cells and cellular membranes. Artificial lipid- 
containing delivery vehicles include liposomes and micelles. A delivery vehicle of the present 
invention can be modified to target to a particular site in a subject, thereby targeting and making use 
of a compound of the present invention at that site. Suitable modifications include manipulating the 
chemical formula of the lipid portion of the delivery vehicle and/or introducing into the vehicle a 
compound capable of specifically targeting a delivery vehicle to a preferred site, for example, a 
preferred cell type. Specifically, targeting refers to causing a delivery vehicle to bind to a particular 
cell by the interaction of the compound in the vehicle to a molecule on the surface of the cell. 
Suitable targeting compounds include ligands capable of selectively (i.e., specifically) binding 
another molecule at a particular site. Examples of such ligands include antibodies, antigens, 
receptors and receptor ligands. Manipulating the chemical formula of the lipid portion of the 
delivery vehicle can modulate the extracellular or intracellular targeting of the delivery vehicle. For 
example, a chemical can be added to the lipid formula of a liposome that alters the charge of the lipid 
bilayer of the liposome so that the liposome fuses with particular cells having particular charge 
characteristics. 

Another preferred delivery vehicle comprises a viral vector. A viral vector includes an 
isolated nucleic acid molecule useful in the present invention, in which the nucleic acid molecules 
are packaged in a viral coat that allows entrance of DNA into a cell. A number of viral vectors can 
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be used, including, but not limited to, those based on alphaviruses, poxviruses, adenoviruses, 
herpesviruses, lentiviruses, adeno-associated viruses and retroviruses. 

A composition can be delivered to a cell culture or patient by any suitable method. Selection 
of such a method will vary with the type of compound being administered or delivered (i.e., 
compound, protein, peptide, nucleic acid molecule, or mimetic), the mode of delivery (i.e., in vitro, 
in vivo, ex vivo) and the goal to be achieved by administration/delivery of the compound or 
composition. According to the present invention, an effective administration protocol (i.e., 
administering a composition in an effective manner) comprises suitable dose parameters and modes 
of administration that result in delivery of a composition to a desired site (i.e., to a desired cell) 
and/or in the desired regulatory event. 

Administration routes include in vivo, in vitro and ex vivo routes. In vivo routes include, but 
are not limited to, oral, nasal, intratracheal injection, inhaled, transdermal, rectal, and parenteral 
routes. Preferred parenteral routes can include, but are not limited to, subcutaneous, intradermal, 
intravenous, intramuscular and intraperitoneal routes. Intravenous, intraperitoneal, intradermal, 
subcutaneous and intramuscular administrations can be performed using methods standard in the art. 
Aerosol (inhalation) delivery can also be performed using methods standard in the art (see, for 
example, Stribling et al., Proc. Natl. Acad. Sci. USA 189:1 1277-1 1281, 1992, which is incorporated 
herein by reference in its entirety). Oral delivery can be performed by complexing a therapeutic 
composition of the present invention to a carrier capable of withstanding degradation by digestive 
enzymes in the gut of an animal. Examples of such carriers, include plastic capsules or tablets, such 
as those known in the art. Direct injection techniques are particularly useful for suppressing graft 
rejection by, for example, injecting the composition into the transplanted tissue, or for site-specific 
administration of a compound, such as at the site of a tumor. Ex vivo refers to performing part of 
the regulatory step outside of the patient, such as by transfecting a population of cells removed from 
a patient with a recombinant molecule comprising a nucleic acid sequence encoding a protein 
according to the present invention under conditions such that the recombinant molecule is 
subsequently expressed by the transfected cell, and returning the transfected cells to the patient. In 
vitro and ex vivo routes of administration of a composition to a culture of host cells can be 
accomplished by a method including, but not limited to, transfection, transformation, electroporation, 
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microinjection, lipofection, adsorption, protoplast fusion, use of protein carrying agents, use of ion 
carrying agents, use of detergents for cell permeabilization, and simply mixing (e.g., combining) a 
compound in culture with a target cell. 

In the method of the present invention, a therapeutic compound, as well as compositions 
comprising such compounds, can be administered to any organism, and particularly, to any member 
of the Vertebrate class, Mammalia, including, without limitation, primates, rodents, livestock and 
domestic pets. Livestock include mammals to be consumed or that produce useful products (e.g., 
sheep for wool production). Preferred mammals to protect include humans. Typically, it is desirable 
to obtain a therapeutic benefit in a patient. A therapeutic benefit is not necessarily a cure for a 
particular disease or condition, but rather, preferably encompasses a result which can include 
alleviation of the disease or condition, elimination of the disease or condition, reduction of a 
symptom associated with the disease or condition, prevention or alleviation of a secondary disease 
or condition resulting from the occurrence of a primary disease or condition, and/or prevention of 
the disease or condition. As used herein, the phrase "protected from a disease" refers to reducing 
the symptoms of the disease; reducing the occurrence of the disease, and/or reducing the severity of 
the disease. Protecting a patient can refer to the ability of a composition of the present invention, 
when administered to a patient, to prevent a disease from occurring and/or to cure or to alleviate 
disease symptoms, signs or causes. As such, to protect a patient from a disease includes both 
preventing disease occurrence (prophylactic treatment) and treating a patient that has a disease 
(therapeutic treatment) to reduce the symptoms of the disease. A beneficial effect can easily be 
assessed by one of ordinary skill in the art and/or by a trained clinician who is treating the patient. 
The term, "disease" refers to any deviation from the normal health of a mammal and includes a state 
when disease symptoms are present, as well as conditions in which a deviation (e.g., infection, gene 
mutation, genetic defect, etc.) has occurred, but symptoms are not yet manifested. 

Various aspects of the invention are described in the following examples; however, the 
following examples are provided for the purpose of illustration and are not intended to limit the 
scope of the present invention. 
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Example 1 

Methods: Gefitinib sensitivity was determined in 1 8 NSCLC cell lines using MTT assays. Cell lines 
were classified as gefitinib sensitive (IC< 1 |i.M), resistant (IC50>10^M) or intermediate sensitivity 
(IC50>l,<10u.M). Oligonucleotide gene arrays (Affymetrix® HGU 133 A, 22.000 genes) were done 
on 10 cell lines. Three distinct filtration and normalization algorithms to process the expression data 
were used, and a list of genes were generated that were both statistically significant (unadjusted 
p=0.001 cutoff) and corrected for false positive occurrence. This approach was used in combination 
with 5 distinct machine learning algorithms used to build a test set for predictor genes that were 
successful for 100% of the test cases. The best discriminators (>3 fold difference in expression 
between sensitive and resistant cell lines) were selected for Real-time RT-PCR. 
Results: A list of 144/210 genes was generated initially from the Affymetric array analysis. By 
using the mathematical algorithm 14 different candidate genes were selected for RT-PCR. Twelve 
of the 14 genes were verified to discriminate between sensitive and resistant cell lines by Real-time 
RT-PCR. 

Conclusion: Based on NSCLC cell line studies it was possible to identify 12 different genes, which 
strongly discriminated gefitinib (Iressa) sensitive cell lines from the resistant ones. The genes are 
ranked in Table 1, with the top 12 genes listed first. This entire biomarker panel is of significant 
value for selecting NSCLC patients for gefitinib treatment. 
Example 2 

The attached report, labeled as "Example 2" describes the identification and further 
investigation of a target gene identified using the gene expression profile disclosed herein. 

While various embodiments of the present invention have been described in detail, it is 
apparent that modifications and adaptations of those embodiments will occur to those skilled in the 
art. It is to be expressly understood, however, that such modifications and adaptations are within the 
scope of the present invention, as set forth in the following claims. 
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Example 2 

In this study the present inventors describe research to examine the influence of E- 
cadherin-regulatory molecules on non-small cell lung cancer (NSCLC) response to EGF receptor 
(EGFR) inhibitors. The EGFR, a member of the erbB family of tyrosine kinases (erbBl-4) plays 
a major role in transmitting stimuli that lead to NSCLC cellular proliferation and survival. 
EGFR, highly expressed in NSCLC, is a primary target for NSCLC therapeutic intervention. In 
clinical trials, 11-20% of patients with chemo-refractory advanced stage NSCLC responded to 
treatment with EGFR inhibitors such as gefitinib (Iressa®, ZD 1839). Currently, there are no 
markers that predict which patients will respond to treatment. NSCLC patients with poor survival 
have decreased expression of E-cadherin, a cell adhesion molecule. E-cadherin expression is 
regulated by the wnt pathway and by zinc finger transcription factors including 5EF1/ZEB1 and 
SIP1/ZEB2. Higher levels of protein expression of E-cadherin were detected in gefitinib 
sensitive NSCLC cell lines and expression was absent in gefitinib resistant lines. Conversely, 
expression of the E-cadherin inhibitors ZEB1 and SIP1 was higher in gefitinib resistant cell 
lines. The Hypothesis of this project is that expression of E-cadherin and its regulatory molecules 
predict response to EGFR inhibitors, and modulating E-cadherin regulatory proteins may 
augment response to EGFR inhibitors in non-small cell lung cancer. 




Chemotherapy is the mainstay of treatment for lung cancer, the leading cause of cancer deaths in 
men and women in the US and though out the world 1 . 
However less than a third of patients with advanced stages 
of NSCLC respond to the best two-chemotherapy drug 
combination. 2 Therefore novel agents that target cancer 
specific biological pathways are needed. The EGFR is one 
of the most appealing targets for novel therapies. EGFR 
plays a major role in transmitting stimuli that lead to 
proliferation, growth and survival of various cancer types 
including NSCLC. Ligand binding to the EGFR receptor 
leads to homo- or heterodimerization of EGFR with other 
ErbB receptors. 3 EGFR is overexpressed in a large 
proportion of invasive NSCLC and in premalignant 
bronchial lesions. Activation of the EGFR receptor leads to simultaneous activation of several 
signaling cascades including the MAPK pathway, the protein kinase C (PKC) pathway and the 
PI(3)K-activated AKT pathway (Figure 1). EGFR signaling translated in the nucleus leads to 
cancer cell proliferation and survival. 3 

In phase II clinical trials, 11-20% of patients with chemo-refractory advanced stage 
NSCLC responded to treatment with the EGFR tyrosine kinease inhibitor gefitinib 4,5 . A trial 
evaluating the activity of the EGFR inhibitor, erlotinib (Tarceva®, OSI-774) has been completed 
and the results will be reported in the near future. A retrospective analysis of 140 patients 
responding to treatment with gefitinib revealed that the presence of BAC features (p=0.005) and 
being a never smoker (p=0.007) were the only independent predictors of response to gefitinib 6 . 
These data suggest that EGFR inhibitor therapy is more active in BAC and in non-smokers. 

Despite the correlation of tumor histology and smoking history with gefitinib response, it 
is of great importance to identify molecular molecules that influence gefitinib responsiveness, 
and to develop adjuvant treatments that enhance the response. To accomplish this goal, it is 
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critical that we understand the aspects of EGFR signaling and which molecules interacting with 
the EGFR pathway dictate responsiveness. 

E-cadherin, a calcium-dependent epithelial cell 
adhesion molecule, plays an important role in tumor 
invasiveness and metastatic potential 7 " 10 . Reduced E- 
cadherin expression is associated with tumor cell 
dedifferentiation, advanced stage and reduced survival in 
patients with NSCLC 11 ' 12 . At the transcriptional level, the 
wnt/P-catenin signaling pathway regulates DE-cadherin 
^8i|Saf expression 13 . Our group recently reported that, inhibition 

I 1 wHb^H9 of GSK3P, involved in the proteasomal degradation of P- 
y V J[ catenin, lead E-cadherin upregulation 14 (Figure 2). E- 

cadherin transcription is also regulated by zinc finger 
transcription factors including, Snail, Slug, ZEB1 and 




Figure 2: E-cadherin regulation 

gjpi '5-17 They repress E-cadherin expression by binding to its promoter and recruiting 



HDAC 17 (Figure 2). We recently reported that inhibiting the ZEB1 or HDAC expression lead to 
upregulation of E-cadherin in NSCLC cell lines 14 . 

We used NSCLC cell lines to: (1) evaluate the growth inhibitory properties of EGFR 
inhibitors by MTT assays, (2) to identify molecular molecules through DNA microarrays and 
westerns that predict response to EGFR inhibitors and (3) to design combination therapies that 
enhance the effect of the EGFR inhibitors. Cell lines were screened for expression of members of 
the EGFR and Wnt signaling pathways. We found that E-cadherin expression is lacking in 
gefitinib resistant cell lines and activated in gefitinib sensitive lines. Therefore we investigated 
the expression of zinc finger transcription factors involved in E-cadherin repression. We found 
that gefitinib resistant lines have high ZEB1 and/or SIP1 expression, and expression is lacking in 
gefitinib-sensitive lines. 

Our hypothesis is that SIP1 and ZEB1 expression predicts EGFR tyrosine kinase 
inhibitors resistance and that modulating the molecular mechanism that regulate E-cadherin 
expression will enhance sensitivity to EGFR inhibitors. I will test this hypothesis by 
manipulating E-cadherin expression and measuring the effect on response to gefitinib. Results of 
this work will be evaluated in clinical trials in patients with NSCLC. 
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Results 

I. EGFR, pEGFR, Her2, ErbB3 and Erb4 expression in NSCLC: EGFR, Her-2 and ErbB3 

cell surface expression was evaluated using flow 

IT! " CS>U " cytometry (Table 1). The majority of NSCLC cell lines 

c^un. mao9 (15/18) had a high percentage of EGFR positive cells and 

^r"™"™!*™^ iom7 »4. 3 <i tm * ee had low or now EGFR expression. The two BAC 

Co**, o* «o sm* cell lines, H322 and H358, had high expression of EGFR 

ma 9i/2« o« 4.7 andHer2. 

™ ™ a The presence of phosphorylated EGFR (pEGFR) vs 

t^4 ^ " EGFR was evaluated by Western blotting in 18 NSCLC 

*we m 117 cell lines (Figure 3, shows 15 cell lines). EGFR was 

78/27 < "° 11 s detected in the majority of NSCLC cell lines whereas only 

„ „ m 0 a subset of these cell lines had (pEGFR). 




Table 1: EGFR and Her2 expression by flow 
cytometry and immuno-fluorescence (MFI), and 
growth inhibition by the EGFR inhibitor 



Figure 3. Expression of EGFR and 
phosphorylated EGFR in NSCLC cell lines. 



II. Effects of EGFR inhibitors on human lung cancer cells growth: The growth inhibitory 
effect of gefitinib, on 18 NSCLC cell lines was 
evaluated using the MTT assay (Table 1). There was no 
correlation between the EGFR expression and gefitinib 
response. We evaluated the change pEGFR following 
gefitinib treatment in two sensitive cell lines, HI 334 
and H322, and two resistant cell lines, HI 264 and 
H1648 (Figure 4). Gefitinib inhibited the 
phosphorylated "active" form of EGFR in sensitive cell 
lines. 



Based on the in vitro experiments, athymic nude mice 
bearing human NSCLC xenografts were treated with 
EGFR inhibitors ZD 1839 or C225. Growth delay was 
evident in tumors after treatment with either agent 
(Figure 5). 



H1334 H1264 H1648 H322 



Figure 4: ZD 1839 downregulates pEGF 
in sensitive NSCLC cell lines. 




Figure 5: Effects of gefitinib on A549 NSCLC xenografts 
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III. E-cadherin, SIP1 and ZEB1 in NSCLC cell lines using microarray and RT-PCR and 
western blotting. High density oligonucleotide microarray (IOAM) analysis of gene expression 
levels of selected genes was developed from 1 1 NSCLC cell 
lines. These cell lines included 2 gefitinib sensitive lines 
(IC50<luM), 5 gefitinib resistant lines (IC50> lOuM), and 4 
lines with intermediate sensitivity (IC50 >l[iM <10uM). 
The expression of E-cadherin, SIP1 and ZEB 1 was 
evaluated and compared to their expression in normal 
bronchial epithelium using the Gene Spring program (Figure 
6). 

E-cadherin expression was more pronounced in gefitinib 
sensitive lines absent in gefitinib resistant lines. This 
expression pattern was confirmed using western blotting 
and real time PCR (RT-PCR) (Figure 7). 

As discussed above, regulation of E-cadherin 
expression involves the zinc finger transcription factors 
ZEB1 and SJP1. Expression of both transcription factors 
was evaluated using real time RT-PCR. ZEB 1 and SIP 1 
were expressed in the gefitinib resistant lines and absent in 
the gefitinib sensitive lines (Figure 8). We also evaluated the expression of Slug, Snail, Wnt7a, 
P-catenin, y-catein, ot-catenin and GSK3P using Western blot analysis or RT-PCR. None of 
theses molecules had a differential pattern of expression in the NSCLC lines (data not shown). 



Cell line (ZD18J9 IC50) 

Figure 6: Expression of E-cadherin, i 
NSCLC cell lines using GeneSprin 
analysis of microarrays. Expression i 
compared in cell lines to their expressio 

s I a k g s a s § 1 3 § § § £ | a 



Figure 7: Western blott analysis of E- 
cadherin expression in NSCLC cell lines 




^ ^ //// #* / ^ ^ / 




Figure 8: Real 
time RT-PCR 
analysis of 
ZEB1 and SIP 1 
expression in 
NSCLC cell 
lines. 



In summary: There was no correlation between gefitinib sensitivity and EGFR expression. E- 
cadherin was detected preferentially in gefitinib sensitive lines. Conversely, the zinc finger 
transcription factors, ZEB1 and SIP1, involved in E-cadherin inhibition were expressed in 
gefitinib resistant lines and absent in gefitinib sensitive lines. 



Future Research Design and Methods 

Aim 1. Evaluate the detrimental effect of the zinc finger proteins ZEB1 and SIP1 on 
NSCLC cell lines sensitivity to EGFR inhibitors. 

A. adenoviral constracts containing ZEB1 or SIP1 will be used to overexpress these proteins 
in gefitinib sensitive cell lines. MTT assay will assess changes in gefitinib sensitivity. 

B. Stably transfected ZEB1 and SIP1 cell lines and untrasfected cell lines will be implanted 
into nude mice. Transplanted mice will be treated with gefitinib and response will be 
compared between the two groups. 

Aim 2. Determine the molecular mechanisms that improve the response to EGFR 
inhibitors in NSCLC cell lines , w v ( tro and /„ v / w . 
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2 A. Examining the effect of "silencing" the E-cadherin transcriptional repressors, SIP] and 
ZEB1, on NSCLC cell lines response to ZD1839: 

To directly examine the role of the zinc-finger 
transcription factors, SIP1 and ZEB1, on gefitinib 
responsive lines, we will develop and test the effect 
of siRNA. siRNA will be prepared for different 
regions of SIP1 and ZEB1 using the silencer kit 
from Dharmacon (Colorado). Their efficacy will be 
tested by RT-PCR. The most effective siRNA for 
SIP1 and ZEB1 will be introduced, individually or 
in combination, into gefitinib resistant lines. The 
effect of these siRNAs on gefitinib responsiveness 
will be evaluated by MTT assay. ZEB1 antibody 
(Santa Cruz, California) and SIP1 antibody (a gift 
from Dr. van Grunsven) will be used to evaluate the efficacy of RNA inhibition. 

2B. Examining the effect of inhibiting GSK3pon gefitinib response in NSCLC cell lines: 
GSK3P phosphorylate P-catenin leading to its ubiqitination and destruction. GSK3P inhibitors, 
such as lithium, increased E-cadherin expression in NSCLC cell lines 14 . We will inhibit 
GSK3P function with an adenovirus (pAdTrack-CMV) encoding a dominant-negative 
GSK3P (dnGSK3p)(a gift from Dr. Woodgett). To determine the effectiveness of this dnGSK3 
we will evaluate the expression of non-phosphorylated P-catenin and E-cadherin by western blot. 
NSCLC cell lines stably transfected with the dnGSK3P construct will be generated. The effect of 
inhibiting GSK3P on NSCLC cell lines response to gefitinib will be evaluated using MTT 




2C. Evaluating the effect of E-cadherin on gefitinib sensitivity: 

Resistant NSCLC lines will be transfected with E-cadherin encoding constructs. Changes in 
NSCLC cell lines response to gefitinib will be assessed by MTT assay. Gefitinib-sensitive lines 
that express E-cadherin will be treated with an E-cadherin antibody (Zymed) and the effect on 
gefitinib responsiveness assessed by MTT assay. This will answer the question whether 
expression of E-cadherin itself is sufficient to determine gefitinib sensitivity or if sensitivity is a 
reflexion of events occurring upstream of it. 

2D. Augmenting the effect of gefitinib responsiveness on NSCLC cell lines in vivo: 
Based on finding from the above in vitro experiments, the best treatment that enhances gefitinib 
sensitivity in NSCLC cell lines will be selected for in vivo experiments in nude mice. Previously 
we showed an inhibitory effect of gefitinib alone on NSCLC xenografts growth (see above). We 
will test the combination of gefitinib with one of the above-evaluated interventions in athymic 
nude mice bearing human NSCLC xenografts in collaboration with Dr. Daniel Chan. 
E-cadherin inducible cell lines from the in vitro experiments will be inoculated subcutaneously 
in nude mice. Mice will be treated with gefitinib with and without the agent that improved the 
gefitinib sensitivity. The two groups will be evaluated for differences in tumor growth 
inhibition. Expression of E-cadherin, SIP1 and ZEB1 will be evaluated both prior to and post- 
treatment by real-time RT-PCR and immunohistochemistry. ZEB1 antibody (Santa Cruz, 
California) and SDP1 antibody (a gift from Dr. van Grunsven) will be used in the 
immunohistochemistry. However, new antibodies will be generated if the above antibodies were 
not effective at detecting proteins in the IHC assays. 
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In the experimental design outlined above we hope to dissect out the events leading to gefitinib 
resistance develop treatment modifications that bypass resistance. 

Aim III. Conduct clinical trial evaluating the correlation between ZEB1, SIP1 and 
resitance to EGFR inhibitors in patients with NSCLC. 

This trial will enroll 29 patients with diagnosis proven Stage I/II NSCLC. Patients will be treated 
with GW572016 ( a dual EGFR and Her2 inhibitor, GalaxoSmithKline Pharmaceuticals) for 4 
weeks. Surgical removal of the tumor will be done at the end of this treatment period. Using RT- 
PCR analysis and immunohistochemistry, samples will be evaluated before and after treatment 
for ZEB1 and SIP1 expression. Correlation will be made between ZEB1 and SIP1 expression 
and EGFR inhibitor effectiveness. 



32 



Exemplary Claims: 

1 . A method to select a cancer patient who is predicted to benefit from therapeutic 
administration of gefitinib, an agonist thereof, or a drug having substantially similar biological 
activity as gefitinib, comprising: 

a) providing a sample of tumor cells from a patient to be tested; 

b) detecting in the sample the expression of at least one gene selected from the 
genes listed in Table 1 ; 

c) comparing the level of expression of the gene or genes detected in the patient 
sample to the level of expression of the gene in Table 1 that has been correlated with 
sensitivity and/or resistance to gefitinib; 

d) selecting the patient as being predicted to benefit from therapeutic 
administration of gefitinib, an agonist thereof, or a drug having substantially similar 
biological activity as gefitinib, if the expression of the gene or genes in the patient's tumor 
cells is statistically more similar to the expression levels of the gene or genes as has been 
correlated with sensitivity to gefitinib. 

2. The method of Claim 1, wherein the step (b) of detecting comprises detecting 
expression of at least two genes from Table 1. 

3. The method of Claim 1, wherein the step (b) of detecting comprises detecting 
expression of at least three genes from Table 1 . 

4. The method of Claim 1, wherein the step (b) of detecting comprises detecting 
expression of at least four genes from Table 1 . 

5. The method of Claim 1, wherein the step (b) of detecting comprises detecting 
expression of at least five genes from Table 1. 

6. The method of Claim 1, wherein the step (b) of detecting comprises detecting 
expression of at least 10 genes from Table 1. 

7. The method of Claim 1, wherein the step (b) of detecting comprises detecting 
expression of at least 25 genes from Table 1. 

8. The method of Claim 1, wherein the step (b) of detecting comprises detecting 
expression of at least 50 genes from in Table 1. 
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9. The method of Claim 1, wherein the step (b) of detecting comprises detecting 
expression of at least 100 genes from Table 1. 

10. The method of Claim 1, wherein the step (b) of detecting comprises detecting 
expression of at least 150 genes in Table 1. 

11. The method of Claim 1, wherein the step (b) of detecting comprises detecting 
expression of substantially all of the genes in Table 1. 

12. The method of any one of Claims 1-11, wherein expression of the gene or genes is 
detected by measuring amounts of transcripts of the gene in the tumor cells. 

1 3 . The method of any one of Claims 1-11, wherein expression of the gene or genes is 
detected by detecting hybridization of at least a portion of the gene or a transcript thereof to a nucleic 
acid molecule comprising a portion of the gene or a transcript thereof in a nucleic acid array. 

14. The method of any one of Claims 1-11, wherein expression of the gene is detected 
by detecting the production of a protein encoded by the gene. 

1 5 . The method of any one of Claims 1-14, comprising detecting expression of: at least 
one gene selected from the group consisting of: E-cadherin, ErbB3, ZEB1 and SIP1. 

1 6. The method of Claim 1 , comprising comparing the expression of the gene or genes 
to expression of the gene or genes in a cell from a non-cancerous cell of the same type. 

17. The method of Claim 1 , comprising comparing the expression of the gene or genes 
to expression of the gene or genes in an autologous, non-cancerous cell from the patient. 

18. The method of Claim 1 , comprising comparing the expression of the gene or genes 
to expression of the gene or genes in a control cell that is resistant to gefitinib. 

1 9. The method of Claim 1 , comprising comparing the expression of the gene or genes 
to expression of the gene or genes in a control cell that is sensitive to gefitinib. 

20. The method of Claim 1 , wherein control expression levels of the gene or genes that 
has been correlated with sensitivity and/or resistance to gefitinib has been predetermined. 
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21. A method to identify molecules that interact with the EGFR pathway to allow or 
enhance responsiveness to EGFR inhibitors, comprising: 

a) providing a sample of cells that are sensitive or resistant to treatment with 
gefitinib; 

b) detecting the expression of at least one gene in the gefitinib-sensitive cells as 
compared to the level of expression of the gene or genes in the gefitinib-resistant cells; 

c) identifying a gene or genes having a level of expression in gefitinib-sensitive 
cells that is statistically significantly different than the level of expression of the gene or 
genes gefitinib-resistant cells as potentially being a molecule that interacts with the EGFR 
pathway to allow or enhance responsiveness to EGFR inhibitors. 

22. A plurality of polynucleotides for the detection of the expression of genes that are 
indicative of sensitivity or resistance to gefitinib, an agonist thereof, or a drug having substantially 
similar biological activity as gefitinib; 

wherein the plurality of polynucleotides consists of polynucleotide probes that are 
complementary to RNA transcripts, or nucleotides derived therefrom, of one or more genes listed 
in Table 1 . 

23 . The plurality of polynucleotides of Claim 22, wherein the plurality of polynucleotides 
consists of at least two of the genes from Table 1 . 

24. The plurality of polynucleotides of Claim 22, wherein the plurality of polynucleotides 
consists of at least 5 of the genes from Table 1 . 

25 . The plurality of polynucleotides of Claim 22, wherein the plurality of polynucleotides 
consists of at least 10 of the genes from Table 1 . 

26. The plurality of polynucleotides of Claim 22, wherein the plurality of polynucleotides 
consists of at least 25 of the genes from Table 1 . 

27. The plurality of polynucleotides of Claim 22, wherein the plurality of polynucleotides 
consists of at least 50 of the genes from Table 1. 

28. The plurality of polynucleotides of Claim 22, wherein the plurality of polynucleotides 
consists of at least 100 of the genes from Table 1. 

29. The plurality of polynucleotides of Claim 22, wherein the plurality of polynucleotides 
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consists of at least 1 50 of the genes from Table 1 . 

30. The plurality of polynucleotides of Claim 22, wherein the plurality of polynucleotides 
consists of all of the genes from Table 1 . 

31. The plurality of polynucleotides of any one of Claims 22-31, wherein said 
polynucleotide probes are immobilized on a substrate. 

32. The plurality of polynucleotides of any one of Claims 22-31, wherein said 
polynucleotide'probes are hybridizable array elements in a microarray. 

33. The plurality of polynucleotides of any one of Claims 22-31, wherein said 
polynucleotide probes are conjugated to detectable markers. 

34. A plurality of polynucleotides for the detection of the expression of genes that are 
indicative of sensitivity or resistance to gefitinib, an agonist thereof, or a drug having substantially 
similar biological activity as gefitinib; 

wherein the plurality of polynucleotides consists of polynucleotide probes that are 
complementary to RNA transcripts, or nucleotides derived therefrom, of genes that are regulated in 
gefitinib-sensitive tumor cells as compared to gefitinib-resistant cells. 

35. A plurality of antibodies, antigen binding fragments thereof, or antigen binding 
peptides, for the detection of the expression of genes that are indicative of sensitivity or resistance 
to gefitinib, an agonist thereof, or a drug having substantially similar biological activity as gefitinib; 

wherein said plurality of antibodies, antigen binding fragments thereof, or antigen binding 
peptides consists of antibodies, antigen binding fragments thereof, or antigen binding peptides, each 
of which selectively binds to a protein encoded by a gene in Table 1 . 

36. A method to identify a compound with the potential to enhance the efficacy of EGFR 
inhibitors, comprising: 

a) contacting a test compound with a cell that expresses a gene selected from any 
one or more of the genes identified in Table 1 ; 

b) identifying compounds that increase the expression or activity of genes in 
Table 1 or the proteins encoded thereby that are correlated with sensitivity to gefitinib, or that 
decrease the expression or activity of genes in Table 1 or the proteins encoded thereby that 
are correlated with resistance to gefitinib, as compounds with potential to enhance the 
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efficacy of EGFR inhibitors. 

37. The method of Claim 36, wherein the cell expresses a gene encoding E-cadherin or 
ErbB3, and wherein step (b) comprises identifying compounds that increase the expression or 
activity of E-cadherin or ErbB3 or the gene encoding E-cadherin or ErbB3. 

38. The method of Claim 36, wherein the cell expresses a gene encoding ZEB 1 and SIP 1 , 
wherein step (b) comprises identifying compounds that decrease the expression or activity ZEB 1 or 
SIP1 or the gene encoding ZEB1 or SIP1. 

39. A method to treat a patient with a cancer, comprising administering to the patient a 
therapeutic composition comprising a compound identified by the method of Claim 36. 

40. A method to treat a patient with a cancer, comprising administering to the patient a 
therapeutic composition comprising a compound that upregulates the expression or activity of E- 
cadherin or ErbB3 or the gene encoding E-cadherin or ErbB3 in the tumor cells of the patient. 

41 . A method to treat a patient with a cancer, comprising administering to the patient a 
therapeutic composition comprising a compound that downregulates the expression of ZEB 1 or SIP 1 
or the gene encoding ZEB 1 or SIP1 in the tumor cells of the patient. 
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